Program-level control of network delay for parallel asynchronous iterative applications
نویسندگان
چکیده
Software distributed shared memory (DSM) platforms on networks of workstations tolerate large network latencies by employing one of several weak memory consistency models. Fully asynchronous parallel iterative algorithms offer an additional degree of freedom to tolerate network latency: they behave correctly when supplied outdated shared data. However, these algorithms can flood the network with messages in the presence of large delays. We propose a method of controlling asynchronous iterative methods wherein the reader of a shared datum imposes an upper bound on its age via use of a blocking Global Read primitive. This reduces the overall number of iterations executed by the reader, thus controlling the amount of shared updates generated. Experiments for a fully asynchronous linear equation solver running on a network of 10 IBM RS/6000 workstations show that the proposed Global Read primitive provides significant performance improvement.
منابع مشابه
Program-Level Control of Network Delay for Parallel Asynchronous Iterative Applications - High Peformance Computing, 1996., 3rd International Conference on
Software distributedshared memory (DSM)platfamzs on networks of workstations tolerate large network latencies by employing one of several weak memory consistency models. Fully asynchronous parallel iterative algorithms offer an additional degree offreedom to tolerate network latency: they behave correctly when supplied outdated shared data. However; these algorithms canjood the network with mes...
متن کاملCongestion Control for Asynchronous Parallel Computing on Workstation Networks
Asynchronous parallel computing can result in high message generation rates, thus triggering network congestion. This paper investigates the network congestion problem that can result from asynchronous parallel programs' high message generation rates. First, we characterize the communication requirements of a large class of supercomputing applications falling under the category of xed-point pro...
متن کاملUse of run time predictions for automatic co-allocation of multi-cluster resources for iterative parallel applications
Metaschedulers co-allocate resources by requesting a fixed number of processors and usage time for each cluster. These static requests, defined by users, limit the initial scheduling and prevent rescheduling of applications to other resource sets. It is also difficult for users to estimate application execution times, especially on heterogeneous environments. To overcome these problems, metasch...
متن کاملAsynchronous Large-Scale Graph Processing Made Easy
Scaling large iterative graph processing applications through parallel computing is a very important problem. Several graph processing frameworks have been proposed that insulate developers from low-level details of parallel programming. Most of these frameworks are based on the bulk synchronous parallel (BSP) model in order to simplify application development. However, in the BSP model, vertic...
متن کاملALTA: Asynchronous Loss Tolerant Algorithms for Grid Computing
This paper describes an environment dedicated to the building of efficient scientific applications for the Grid on top of unreliable communication networks. Nowadays, scientific computing applications are usually built on top of reliable communication protocols (such as TCP). Nevertheless, the additional cost introduced by the reliability layer is not negligible in wide area network-based grid ...
متن کامل